DNA Sequence Complexity Reveals Structure Beyond GC Content in Nucleosome Occupancy
نویسندگان
چکیده
We introduce methods that rapidly evaluate a battery of informationtheoretic and algorithmic complexity measures on DNA sequences in application to potential binding sites for nucleosomes. The first application of this new tool demonstrates structure beyond GC content on DNA sequences in the context of nucleosome binding. We tested the measures on well-studied genomic sequences of size 20K and 100K bps. The measures reveal the known in vivo versus in vitro predictive discrepancies, but they also uncover the internal structure of G and C within the nucleosome length, thus disclosing more than simply GC content when one examines alphabet transformations that separate and scramble the GC content signal and the DNA sequence. Most current prediction methods are based upon training (e.g. k-mer discovery), the one here advanced, however, is a training-free approach to investigating informative measures of DNA information content in connection with structural nucleosomic packing.
منابع مشابه
DNA-methylation effect on cotranscriptional splicing is dependent on GC architecture of the exon-intron structure.
DNA methylation is known to regulate transcription and was recently found to be involved in exon recognition via cotranscriptional splicing. We recently observed that exon-intron architectures can be grouped into two classes: one with higher GC content in exons compared to the flanking introns, and the other with similar GC content in exons and introns. The first group has higher nucleosome occ...
متن کاملNoncoding DNA, isochores and gene expression: nucleosome formation potential
The nucleosome formation potential of introns, intergenic spacers and exons of human genes is shown here to negatively correlate with among-tissues breadth of gene expression. The nucleosome formation potential is also found to negatively correlate with the GC content of genomic sequences; the slope of regression line is steeper in exons compared with noncoding DNA (introns and intergenic space...
متن کاملWhole-genome comparison of Leu3 binding in vitro and in vivo reveals the importance of nucleosome occupancy in target site selection.
Sequence motifs that are potentially recognized by DNA-binding proteins occur far more often in genomic DNA than do observed in vivo protein-DNA interactions. To determine how chromatin influences the utilization of particular DNA-binding sites, we compared the in vivo genome-wide binding location of the yeast transcription factor Leu3 to the binding location observed on the same genomic DNA in...
متن کاملActive nucleosome positioning beyond intrinsic biophysics is revealed by in vitro reconstitution.
Genome-wide nucleosome maps revealed well-positioned nucleosomes as a major theme in eukaryotic genome organization. Promoter regions often show a conserved pattern with an NDR (nucleosome-depleted region) from which regular nucleosomal arrays emanate. Three mechanistic contributions to such NDR-array-organization and nucleosome positioning in general are discussed: DNA sequence, DNA binders an...
متن کاملDNA physical properties determine nucleosome occupancy from yeast to fly
Nucleosome positioning plays an essential role in cellular processes by modulating accessibility of DNA to proteins. Here, using only sequence-dependent DNA flexibility and intrinsic curvature, we predict the nucleosome occupancy along the genomes of Saccharomyces cerevisiae and Drosophila melanogaster and demonstrate the predictive power and universality of our model through its correlation wi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1708.01751 شماره
صفحات -
تاریخ انتشار 2017